Inherent Characteristics of Traceability Artifacts
نویسندگان
چکیده
This paper describes ongoing work to characterize the inherent ease or “traceability” with which a textual artifact can be traced using an automated technique. Software traceability approaches use varied measures to build models that automatically recover links between pairs of natural language documents. Thus far, most of the approaches use a single-step model, such as logistic regression, to identify new trace links. However, such approaches require a large enough training set of both true and false trace links. Yet, the former are by far in the minority, which reduces the performance of such models. Therefore, this paper formulates the problem of identifying trace links as the problem of finding, for a given logistic regression model, the subsets of links in the training set giving the best accuracy (in terms of Gmetric) on a test set. Using hill climbing with random restart for subset selection, we found that, for the ChangeStyle dataset, we can classify links with a precision of up to 40% and a recall of up to 66% using a training set as small as one true candidate link (out of 33) and 41 false links. To get better performance and learn the best possible logistic regression classifier, we must “discard” links in the trace dataset that increase noise to avoid learning with links that are not representative. This preliminary work is promising because it shows that few correct examples may perform better than several poor ones. It also shows which inherent characteristics of the artifacts make them good candidates to learn efficient traceability models automatically, i.e., it reveals their traceability.
منابع مشابه
An Ontological Approach for the Semantic Recovery of Traceability Links between Software Artifacts An Ontological Approach for the Semantic Recovery of Traceability Links between Software Artifacts
Traceability links provide support for software engineers in understanding relations and dependencies among software artifacts created during the software development process. In this research, we focus on reestablishing traceability links between existing source code and documentation to support software maintenance. We present a novel approach that addresses this issue by creating formal onto...
متن کاملEvaluation of traceability recovery in context: A taxonomy for information retrieval tools
Background: Development of complex, software intensive systems generates large amounts of information. Several researchers have developed tools implementing information retrieval (IR) approaches to suggest traceability links among artifacts. Aim: We explore the consequences of the fact that a majority of the evaluations of such tools have been focused on benchmarking of mere tool output. Method...
متن کاملUNICASE Trace Client: A CASE Tool Integrating Requirements Engineering, Project Management and Code Implementation
Artifacts for requirements engineering, project management and code implementation are usually stored in separate tools, which makes traceability between these artifacts difficult. We developed the tool UNICASE Trace Client, which stores the aforementioned artifacts in a single environment with full traceability between all artifacts. In this paper, we describe the three traceability link creat...
متن کاملTracing requirements and source code during software development
Traceability links between requirements and source code are often created after development. This reduces the possibilities for developers to use these traceability links during the development process. Additionally, existing approaches applied after development do not consider artifacts from project management, which are used for planning and organizing a project. These artifacts can serve as ...
متن کاملTowards a Meta Aspect for Traceability
Crosscutting concerns are represented in different ways using different artefacts throughout the life cycle. The evolution of crosscutting concerns must be controlled to guarantee the consistency of requirements and to avoid costly rework. This control capability is, in essence, traceability. This paper describes an initial general vision of how traceability of crosscutting concerns can be supp...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015